[SPARK-13167][SQL] Include rows with null values for partition column when reading from JDBC datasources. #11063
Conversation
There was a problem hiding this comment.
this actually breaks api compatibility
.
There was a problem hiding this comment.
Thank you for reviewing the patch, Reynold.
This particular jdbc method where I made the signature changes is not public. It i defined as private def jdbc ..
|
@rxin Thank you for reviewing the PR. As I mentioned in my comment, I did not change the public method. Any suggestions to improve this fix ? |
There was a problem hiding this comment.
can you add some documentation to this function to explain the parameters?
just do them with @param
There was a problem hiding this comment.
also 4 space indent for function params
…null value partition column rows
f4358bb to
1e6a631
Compare
|
Thanks for input, Reynold . Update the PR to specify the is null clause in the first partition where clause. Please review. |
|
Thanks - can you update the pull request description to reflect the latest change? |
|
sure. Updated the description. |
|
Test build #2599 has finished for PR 11063 at commit
|
|
Thanks - I'm merging this in master. |
|
Test build #2600 has finished for PR 11063 at commit
|
|
Thank you. |
… when reading from JDBC datasources. Rows with null values in partition column are not included in the results because none of the partition where clause specify is null predicate on the partition column. This fix adds is null predicate on the partition column to the first JDBC partition where clause. Example: JDBCPartition(THEID < 1 or THEID is null, 0),JDBCPartition(THEID >= 1 AND THEID < 2,1), JDBCPartition(THEID >= 2, 2) Author: sureshthalamati <suresh.thalamati@gmail.com> Closes apache#11063 from sureshthalamati/nullable_jdbc_part_col_spark-13167.
…olumn when reading from JDBC datasources. apache#11063
Rows with null values in partition column are not included in the results because none of the partition
where clause specify is null predicate on the partition column. This fix adds is null predicate on the partition column to the first JDBC partition where clause.
Example:
JDBCPartition(THEID < 1 or THEID is null, 0),JDBCPartition(THEID >= 1 AND THEID < 2,1),
JDBCPartition(THEID >= 2, 2)